Story Segmentation of Broadcasted Sports Videos with Intermodal Collaboration
نویسندگان
چکیده
This paper investigates the problem of efficiently describing broadcasted sports videos for effective multimedia applications. Considering the sports videos as a sequence of recurrent semantic story units, we propose a method for segmenting the sports videos into the story units and attaching the closed-caption segments, which correspond to the story units, as the detailed descriptions. This proposed method restricts the use of much domain-dependent information and can be used to acquire the semantic content. We first try to segment the closed-caption text into scene units, a set of which comprises a story unit, in a probabilistic framework based on Bayesian Networks. Finding the boundaries of the set of the scene units enables us to generate the story units in the closed-caption text. Template matching in the image stream also segments the video stream into video story units. Finally, temporal association attaches the appropriate closed-caption story unit, which includes the detailed information about semantic content, to each segmented video story unit. We conduct experiments using American football and baseball videos, obtaining successful story segmentation results, a recall rate of 92.5% and a precision rate of 91.5%, and we also discuss the potentiality for utilizing them for a video retrieval system.
منابع مشابه
Automatic Story Segmentation of Closed-Caption Text for Semantic Content Analysis of Broadcasted Sports Video
Sports videos can be characterized as a sequence of recurrent semantic story units. Storing sports videos in this story-unit-based form will lead to develop an intelligent content-based retrieval, browsing, and summarization system. The storage requires segmentation of videos and semantic understanding of each segment. Since transcribed broadcasted video speech, the closed-caption text, can be ...
متن کاملIntermodal collaboration: a strategy for semantic content analysis for broadcasted sports video
This paper presents intermodal collaboration: a strategy for semantic content analysis for broadcasted sports video. The broadcasted video can be viewed as a set of multimodal streams such as visual, auditory, text (closed caption) and graphics streams. Collaborative analysis for the multimodal streams is achieved based on temporal dependency between their streams, in order to improve the relia...
متن کاملCollaborative Multimedia Analysis for Detecting Semantical Events from Broadcasted Sports Video
In this paper, we present an approach towards detecting semantical events from broadcasted sports video through collaborative multimedia analysis, called intermodal collaboration. Broadcasted video can be viewed as a set of multimodal streams such as visual, auditory, and textual (closed caption: CC) streams. Considering temporal dependency between their streams, we aim to improve the reliabili...
متن کاملEvent Based Video Indexing by Intermodal Collaboration
In this paper, we propose event based video indexing, which is a kind of indexing by semantical contents. To achieve this, we exploit the idea of intermodal collaboration, i.e. collaborative processing taking account of the semantical dependency between multimodal information streams consisting of visual, auditory, and text (closed caption: CC) streams. The proposed method attempts to make temp...
متن کاملStory based representation for broadcasted sports video and automatic story segmentation
This paper presents a model to represent a broadcasted sports video in a semantical way and proposes a method to segment the sports video into the semantical units for the representation. Representation of a video should clarify its semantical content as accurately as possible. Our model structurizes the video and gives the suitable semantical descriptions to its particular time locations based...
متن کامل